WHAT IS CLAIMED IS: 

1. An apparatus for controlling a movable robot 
comprising a camera, moving means, and a device for outputting 
a sound, which comprises: 

means for recognizing a subject to be followed up, which 
recognizes the subject on the basis of an image taken by the 
camera; 

means for recognizing a distance to from the subject 
having been recognized by the means for recognizing a subject 
to be followed up; 

means for controlling movement, which controls said 
moving means so as to keep the distance from said movable robot 
to the subject, having been recognized by said means for 
recognizing a distance to the subject, at a predetermined 
distance; and 

means for controlling the outputting of a sound, which 
outputs a sound or a voice related to the distance to the 
subject . 

2. The apparatus according to Claim 1, wherein said 
moving means of the movable robot moves by two legs 1 walking. 

3. The apparatus according to Claim 1, which further 
comprises means for holding map information, which holds map 
information of an area within which said movable robot moves, 
and 
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wherein said means for controlling movement determines 
the actuation of said moving means based on the map information 
held in said means for holding map information. 

5 4. The apparatus according to Claim 3, wherein a 

restricted area which prohibits approach is set in said map 
information held in said means for holding map information. 

5. The apparatus according to Claim 1, wherein the 
10 subject to be followed up is a person, and which further 

comprises means for judging instruction from a person. 

6. The apparatus according to Claim 5, wherein said 
means for judging instruction from a person judges whether or 

15 not the robot follows up the person based on the results of 
recognition in which the person is recognized from the face 
image . 

7. The apparatus according to Claim 6, wherein said 
20 means . for judging instruction from a person judges the 

instruction from the person based on at least one of posture, 
and gesture of said person. 

8. The apparatus according to Claim 5, wherein said 
25 means for judging instruction from a person judges the 

instruction from the person based on a voice vocalized from 
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said person. 

9. The apparatus according to Claim 1, wherein said 
means for controlling the outputting of a sound changes a volume 

5 of voice outputted to said device for outputting a sound, based 
on a circumferential noise level. 

10. A process for controlling a movable robot 
comprising a camera, moving means, and a device for outputting 

10 a sound, which comprises: 

a step for recognizing a subject to be followed up, which 
recognizes the subject on the basis of an image taken by the 
camera; 

a step for recognizing a distance to the subject having 
15 been recognized by the step for recognizing a subject to be 
followed up; 

a step for controlling movement, which controls said 
moving means so as to keep the distance to the subject having 
been recognized by said step for recognizing a distance to the 
20 subject at a predetermined distance; and 

a step for controlling the outputting of a sound, which 
outputs a sound or a voice related to the distance to the 
subject . 

25 11. A program for controlling a movable robot 

comprising a camera, moving means, and a device for outputting 



41 



a sound: which comprises having a computer to serves as 

means for recognizing a subject to be followed up, which 
recognizes the subject on the basis of an image taken by the 
camera; 

means for recognizing a distance to the subject having 
been recognized by the means for recognizing a subject to be 
followed up; 

means for controlling movement, which controls said 
moving means so as. to keep the distance to the subject having 
been recognized by said means for recognizing a distance to 
the subject at a predetermined distance; and 

means for controlling the outputting of a sound, which 
outputs a sound or a voice related to the distance to the 
subj ect . 
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